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SPEECH SIGNAL DECODING METHOD AND APPARATUS, 
SPEECH SIGNAL ENCODING/DECODING METHOD AND APPARATUS, 
AND PROGRAM PRODUCT THEREFOR 
FIELD OF THE INVENTION 

This invention relates to a method of encodingand decoding 
a speech signal at a low bit rate. More particularly, the 
invention relates to a speech signal decoding method and 
apparatus, a speech signal encoding/decoding method and 
apparatus and a program product for improving the quality of 
sound in noise segments. 
BACKGROUND OF THE INVENTION 

A method of encoding a speech signal by separating the 
speech signal into a linear prediction filter and its driving 
excitation signal (excitation signal, excitation vector) is 
used widely as a method of encoding a speech signal efficiently 
at medium to low bit rates. One such method that is typical is 
CELP (Code-Excited Linear Prediction). With CELP, a linear 
prediction filter for which linear prediction coefficients 
representing the frequency characteristic of input speech have 
been set is driven by an excitation signal (excitation vector) 
represented by the sum of a pitch signal (pitch vector), which 
represents the pitch period of speech, and a sound source signal 

(sound source vector) comprising a random number or a pulse train, 
whereby there is obtained a synthesized speech signal 

(reconstructed signal, reconst ructed vector) . At th i s t i me the 
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pitch signal and the sound source signal are multiplied by 
respective gains (pitch gain and sound source gain). For a 
discussion of CELP, see the paper (referred to as "Reference 1") 
"Code excited linear prediction: High quality speech at very 
5 low bit rates" by M. Schroeder et. al (Proc. of IEEE Int. Conf. 
on Acoust., Speech and Signal Processing, pp. 937 - 940, 1 985). 

Mobile communication such as by cellular telephone 
requires good quality in a noisy environment typified by the 
congestion of busy streets and by the interior of a traveling 

10 automobile. A problem with CELP-based speech encoding is a 
marked decline in sound quality for speech on which noise has 
been superimposed (such speech will be referred to as 
"background-noise speech" below). 

A method of smoothing the gain of a sound source in a decoder 

15 is an example of a known technique for improving the encoded 
speech qual ity of background-noise speech. In accordance with 
this method, a temporal change in short-term average power of 
a sound source signal that has been multiplied by the aforesaid 
sound source gain is smoothed by smoothing the sound source gain. 

20 As a result, a temporal change in short-term average power of 
the excitation signal also is smoothed. This method improves 
sound quality by reducing extreme fluctuation in short-term 
average power in decoded noise, which is one cause of degraded 
sound quality. 

25 With regard to a method of smoothing the gain of a sound 
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source signal, see Section 6.1 of "Digital Cellular 
Telecommunication System; Adaptive Multi-Rate Speech 
Transcoding" (ETS I Technical Report, GSM 06.90 version 2.0.0) 
(Referred to as "Reference 2"). 
5 Fig. 8 is a block diagram illustrating an example of the 

structure of a conventional speech signal decoder which improves 
the encoded quality of background-noise speech by smoothing the 
gain of a sound source signal. It is assumed here that input 
of a bit sequence occurs in a period (frame) of T fr msec (e.g., 

10 20 ms) and that computation of a reconstructed vector is 

performed in a period (subframe) of T fr /N sfr msec (e.g., 5 ms) , 
where N sfr is an integer (e.g., 4). Let frame length be L fr 
samples (e.g., 320 samples) and let subframe length be L sfr 
samples (e.g., 80 samples). The numbers of these samples is 

15 decided by the sampling frequency (e.g., 16 kHz) of the input 
speech signal. 

The components of the conventional speech signal decoder 
will be described with reference to Fig. 8. 

The code of the bit sequence enters from an input terminal 
20 10. A code input circuit 1010 splits the code of the bit 

sequence that has entered from the input terminal 10 and converts 
it to indices that correspond to a pluralityof decodeparameters. 
An index corresponding to a line spectrum pair (LSP) which 
represents the frequency characteristic of the input signal is 
25 output to an LSP decoding circuit 1020, an index corresponding 
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to a delay L pd that represents the pitch period of the input 
signal is output to a pitch signal decoding circuit 1210, an 
index corresponding to a sound source vector comprising a random 
number or a pulse train is output to sound source signal decoding 
5 circuit 1110, an index corresponding to a first gain is output 
to a first gain decoding circuit 122 0, and an index corresponding 
to a second gain is output to a second gain decoding circuit 1120. 

The LSP decoding circuit 1020 has a table (not shown) in 
which multiple sets of LSPs have been stored. The LSP decoding 

10 circuit 1 020 receives as an input the index that is output from 
the code input circuit 1010, reads the LSP that corresponds to 
this index out of the table and obtains LSP ~q, (Nsfr) (n) in the 
N sfr th subframe of the present frame (the nth frame), where N D 
represents the degree of linear prediction. 

15 The LSP of an (N sfr -1)th subframe from the first subframe 

is obtained by linearly interpolating " q, (Nsfr) (n) and S sfr (i) 
(whe r e i =0, ■■■ , L s f ) . 

LSP ~ q j 1 N s f r 1 (n) (where j =1 , •■• , Np, m = 1, ••■ , N s f r ) is output 
to a linear prediction coefficient conversion circuit 1030 and 

20 to a smoothing coefficient calculation circuit 1310. 

The linear prediction coefficient conversion circuit 1030 
receives as an input a signal output from the LSP "q/" 1 ' (n) 
(where j = 1 , ■■• , Np, m=1 , ••• , N s f r ) decoding circuit 1 020. 

The linear prediction coefficient conversion circuit 1030 

25 converts the entered LSP "q J lB) (n) to a linear prediction 
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coefficient ~ a i ( ■ 1 (n) (where j=1, — , Np, ra = 1, N sfr ) and 
outputs "oj (n) to a synthesis filter 1 040. A known method 
such as the one described in Section 5. 2. 4 of Reference 2 is used 
to convert the LSP to a linear prediction coefficient. 
5 The sound source signal decoding circuit 1110 has a table 

(not shown) in which a plurality of sound source vectors have 
been stored. The sound source signal decoding circuit 1110 
receives as an input the index that is output from the code input 
circuit 1010, reads the sound source vector that corresponds to 

10 this index out of the table and outputs this vector to a second 
ga i n circuit 1130. 

The second gain decoding circuit 1120 has a table (not 
shown) in which a plurality of gains have been stored. The 
second gain decoding circuit 1120 receives as an input the index 

15 that is output from the code input circuit 1010, reads a second 
gain that corresponds to this index out of the table and outputs 
this gain to a smoothing circuit 1320. 

The second gain circuit 1130, which receives as inputs the 
first sound source vector output from the sound source signal 

20 decoding circuit 1110 and the second gain output from the 

smoothing circuit 1320, multipl ies the first sound source vector 
by the second gain to generate a second sound source vector and 
outputs the second sound source vector to an adder 1050. 

A memory circuit 1240 holds an excitation vector input 

25 thereto from the adder 1050. The memory circuit 1240, which 



holds the excitation vector applied to it in the past, outputs 
the vector to a pitch signal decoding circuit 1210. 

The pitch signal decoding circuit 1210 receives as inputs 
the past excitation vector held by the memory circuit 1240 and 
the index output from the code input circuit 1010. The index 
specifies a delay L pd . In regard to this past excitation vector, 
the pitch signal decoding circuit 1210 cuts vectors of L sfr 
samples corresponding to the vector length from a point L pd 
samples previous to the starting point of the present frame and 
generates a fi rst pitch signal (vector). I n case of ~ a i ' ra 1 (n) , 
the pitch signal decoding circuit 1210 cuts out vectors of L pd 
samples, repeatedly connects the L pd samples and generates a 
first pitch vector, which is a sample of vector length L sfr . The 
pitch signal decoding circuit 1210 outputs the first pitch 
vector to a first gain circuit 1230. 

The first gain decoding circuit 1220 has a table (not shown) 
in which a plurality of gains have been stored. The first gain 
decoding circuit 1220 receives as an input the index that is 
output from the code input circuit 1010, reads a first gain that 
corresponds to this index out of the table and outputs this gain 
to the first gain circuit 1 230. 

The first gain circuit 1230, which receives as inputs the 
first pitch vector output from the pitch signal decoding circuit 
1210 and the first gain output from the first gain decoding 
circuit 1220, multiplies the entered first pitch vector by the 



first gain to generate a second pitch vector and outputs the 
generated second pitch vector to the adder 1050. 

The adder 1050, to which the second pitch vector output from 
the first gain circuit 1230 and the second sound source vector 
output from the second gain circuit 1130 are input, adds these 
inputs and outputs the sum to the synthesis filter 1040 as an 
exc i tat ion vector. 

The smoothing coefficient calculation circuit 1310, to 
which LSP ~q } (m) (n) output from the LSP decoding circuit 1020 
is input, calculates an average LSP ~~q 0j (n) in the nth frame 
in accordance with Equation (1) below. 

q oj (n) = 0.84 • q 0j (n - 1) + 0. 16 ■ (n) - ■ -(1) 

Next, with respect to each subframe m, the smoothing 
coefficient calculation circuit 1310 calculates the amount of 
fluctuation d 0 (m) of the LSP in accordance with Equation (2) 
below. 

|q 0j (n)-qf>(n)| 

j=i q 0j (n) 

A smoothing coefficient k 0 (m) in the subframe m is 
calculated in accordance with Equation (3) below. 

k 0 (m) =min (0. 25, max (0, d 0 (m) -0. 4) ) /0. 25 - (3) 
where m i n Cx, y) is a function in which the smaller of x and y is 
taken as the value and max (x, y) is a function in which the larger 
of x and y is taken as the value. The smoothing coefficient 
calculation circuit 1310 finally outputs the smoothing 



coefficient k„ (m) to the smoothing circuit 1320. 

The smoothing coefficient k„ (m) output from the smoothing 
coefficient calculation circuit 1310 and the second gain output 
from the second gain decoding circuit 1120 are input to the 
smoothing circuit 1320. The latter then calculates an average 
gain g 0 (in) i n accordance wi th Equat i on (4) be I ow f rom second 
ga i n ~g 0 (m) in subf rame m. 

go(m) = ^£g 0 (m-i) ■■■(4) 

-> i=0 

Next, second ga i n ~g 0 (m) i s subs t i tu ted i n accordance w i th 
Equation (5) below. 

So (m) = g 0 • k 0 (m) + g 0 (m) • (1 - k 0 (m)) ■ ■ -(5) 

Finally the smoothing circuit 1320 outputs the second gain 
g 0 (m) to the second gain circuit 1130. 

The excitation vector output from the adder 1050 and the 
I i near predict ion coef f icient ~ a i (m) (n) (where j = 1, Np, m=1, 
•••,N sfr ) output from the linear prediction coefficient 
conversion circuit 1030 are input to the synthesis filter 1040. 
The latter drives a synthesis f i Iter 1 /A (z) , for which the I i near 
prediction coefficients have been set, by the excitation vector 
to thereby calculate the reconstructed vector, which is output 
from an output terminal 2 0. The transfer function 1/A (z) of the 
synthesis filter is represented by Equation (6) below, where it 
is assumed that the linear prediction coefficient i s represented 
by a , (j = i, -, N p ) . 



l/A(z) = l/(l-£a i z i ) "-(6) 

Fig. 9 is a block diagram illustrating the structure of a 
speech signal encoder in a conventional speech signal 
encod i ng/decod i ng apparatus. The speech s i gna I encoderwill be 
described with reference to Fig. 9. It should be noted that the 
fi rst gain ci rcui t 1 230, the second gain circuit 1130, the adder 
1 050 and the memory circuit 1 240 are the same as those described 
in connection with the speech signal decoding apparatus shown 
in Fig. 8 and need not be described again. 

The encoder has an input terminal 30 to which an input 
signal (input vector) is applied, the input vector being 
generated by sampling a speech signal and combining a plurality 
of samples into one vector as one frame. 

The input vector from the input terminal 30 is applied to 
a linear prediction coefficient calculation circuit 5510, which 
proceeds to subject the input vector to linear prediction 
analysis and obtain linear prediction coefficients. A known 
method of performing linear prediction analysis is described in 
Chapter 8 "Linear Predictive Codingof Speech" in L. R. Rabiner 
et. al "Digital Processing of Speech Signals" (Prentice-Hall, 
1978) (referred to as "Reference 3"). 

The linear prediction coefficient calculation circuit 5510 
outputs the linear prediction coefficients to an LSP 
conversion/quantization circuit 5520. 
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Upon receiving the linear prediction coefficients output 
from the linear prediction coefficient calculation circuit 5510, 
the LSP conversion/quantization circuit 5520 converts the 
linear prediction coefficients to an LSP and quantizes the LSP 
to obtain a quantized LSP. An example of a well-known method 
of converting linear prediction coefficients to an LSP is that 
described in Section 5.2.3 of Reference 2. An example of a 
method of quantizing an LSP is that described in Section 5. 2. 5 
of Reference 2. 

As described in connection with the LSP decoding circuit 
of Fig. 8, the quantized LSP is assumed to be a quantized LSP 
"q, 1Nsfr) ( n ) in the N sfr th subframe of the present frame (the nth 
frame) (where j = 1, ■■■ Np) . 

The quantized LSP of an (N sfr -1)th subframe from the first 
subframe is obtained by linearly interpolating '" sfr) (n) 

and S s f r (i) (where j=1, Lsf) . Furthermore, this LSP is 
assumed to be LSP Q j (Nsfr ) ( n ) (j=1, ---Np) in the N sfr th subframe 
of the present frame (the nth frame). The LSP of the (N sfr -1) th 
subframe from the first subframe is obtained by linearly 
interpolating q.'^sfn (n ) and qj(N sfr) (n _ 1 j 

The LSP conversion/quantization circuit 5520 outputs 
LSPq, (ml (n) (where j = 1, - , Np, m = 1, - , N s f r ) and the quantized LSP 
~q, (m) (n) (where j=1, Np, m = 1, N s f r ) to a I i near pred i ct ion 
coefficient conversion circuit 5030 and outputs an index 
corresponding to the quantized LSP "q J (Hsfrl (n) (where j=1, 
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— ,Np) to a code output circuit 6010. 

The LSP q, (m) (n) (where j = 1 , ■•■ , Np, m= 1 , ••• , N s f r ) and the 
quantized LSP "q/™ 1 (n) (where j = 1 , - , Np, m=1 , ■■• , N s f r ) output 
from the LSP conversion/quantization circuit 5520 are input to 
the linear prediction coefficient conversion circuit 5030, 
which proceeds to convert q J ,l,) (n) to a linear prediction (LP) 
coefficient a i (m) (n) (where j = 1, Np, m = 1 ,—, N, f r ) , convert a 
j ln) (n) to a linear prediction coefficient ~ a , U) (n) (where j = 1, 
Np, m=1, N sfr ), output the linear predict ion coefficient a 
j {m) (n) to a weighting filter 5050 and to a weighting synthesis 
filter 5040, and output the linear prediction coefficient " 
a/ m) (n) to the weighting synthesis filter 5040. 

An example of a well-known method of converting an LSP to 
I inear pred i ct ion (LP) coefficients and converting a quantized 
LSP to quantized linear prediction coefficients is that 
described in Section 5.2.4 of Reference 2. 

The input vector from the input terminal 30 and the linear 
prediction coefficients from the linear prediction coefficient 
conversion circuit 5030 are input to the weighting f i Iter 5050. 
The latter uses these linear prediction coefficients to produce 
a weighting filter W (z) corresponding to the characteristic of 
the human sense of hearing and drives this weighting filter by 
the input vector, whereby there is obtained a weighted input 
vector. The weighted input vector is output to subtractor 5060. 
The transfer function W (z) of the weighting filter is 
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represented by Equation (7) below. 
W(z)=Q(z/r,)/Q(z/r.) - (7) 
where the fol lowing holds. 

! = 1 

G(z/r 2 )=l-£^z i • • -(8) 

Here r, and r 2 represent constants, e.g., r, = 0.9, r 2 = 0.6. 
Refer to Reference 1, etc., for the details of the weighting 
f i I ter. 

The excitation vector output from the adder 1050 and the 
linear prediction coefficient a s (m) (n) (where j = 1, — , Np, m = 1, 
N sf r ) and the I i near prediction coefficient " a i (m) (n) (where 
j = 1 , ••• , Np, m = 1 , ••• , N s f r ) output from the linear prediction 
coefficient conversion circuit 5030 are input to the weighting 
synthes is filter 5040. 

The weighting synthesis filter 5040 drives the weighting 
synthesis filter for which a l [m) (n), a^ [ml (n) have been set, 
name I y 

H(z)W(z)=Q(z/r 1 )/[A(z)Q(z/r z )] - (9) 
by the above-mentioned excitation vector, whereby a weighted 
reconstructed vector is obtained. 

The transfer function H (Z) = 1 /A (z) of the synthesis filter 
is represented by Equation (10) below. 

l/A(z) = l/(l-^ m) z i ) • • -(10) 

The weighted input vector output from the weighting filter 
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5050 and the weighted reconstructed vector output from the 
weighting synthesis fi Iter 5040 are input tothesubtractor 5060. 
The latter calculates the difference between these vectors and 
outputs the difference to a minimizing circuit 5070 as a 
difference vector. 

The minimizing circuit 5070 successively outputs indices 
corresponding to all sound source vectors that have been stored 
in a sound source signal generating circuit 5110 to the sound 
source signal generating circuit 5110, successively outputs 
indices corresponding to all delays L pd within a range stipulated 
in a pitch signal generating circuit 5210 to the pitch signal 
generating circuit 5210, successively outputs indices 
corresponding to all first gains that have been stored in a first 
gain generating circuit 6220 to the first gain generating 
circuit 6220, and success i ve I y outputs i nd i ces cor respond i ng to 
all second gains that have been stored in a second gain 
generating circuit 6120 to the second gain generating circuit 
6120. 

Further, difference vectors output from the subtractor 
5060 successively enter the minimizing circuit 5070. The 
latter calculates the norms of these vectors, selects a sound 
source vector, a delay L pd , a first gain and a second gain that 
will minimize the norms and outputs indices corresponding to 
these to the code output circuit 6010. The indices output from 
the minimizing circuit 5070 successively enter the pitch signal 
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generating circuit 5210, the sound source signal generating 
circuit 5110, the first gain generating circuit 6220 and the 
second gain generating circuit 6120. 

With the exception of wiring (connections) relating to 
input and output, the pitch signal generating circuit 5210, the 
sound source signal generating circuit 5110, the first gain 
generating circuit 6220 and the second gain generating circuit 
6120 are identical with the pitch signal decoding circuit 1210, 
the sound source signal decoding circuit 1110, the first gain 
decoding circuit 1220 and the second gain decoding circuit 1120 
shown in Fig. 8. Accordingly, these circuits need not be 
explained aga i n. 

The index corresponding to the quantized LSP output from 
the LSP conversion/quantization circuit 5520 is input to the 
code output circuit 6010, and so are the indices, which are 
output from the minimizing circuit 5070, corresponding to the 
sound source vector, the delay L ptf , the f i rst ga i n and the second 
gain. The code output circuit 6010 converts these indices to 
the code of a bit sequence and outputs the code from an output 
terminal 40. 

SUMMARY OF THE DISCLOSURE 

In the course of eager investigations toward the present 
invention, various problems have been encountered. 

A problemwith the conventional coder and decoder described 
above is that there are instances where an abnormal sound is 
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produced in noise segments when the sound source gain (the second 
gain) is smoothed. This is because the sound source gain 
smoothed in the noise segments may take on a value that is much 
larger than the sound source gain before smoothing. 

The reason for this is that since there are cases where the 
sound source gain is smoothed even in a speech segment, it so 
happens that when a sound source gain obtained in the past is 
used to temporally smooth the first-mentioned sound source gain 
in a noise segment, the influence of a gain having a large value 
that corresponds to a past speech segment becomes a factor. 

Accordingly, an object of the present invention in one 
aspect thereof is to provide an apparatus and method, and a 
program product as well as a medium on which the related program 
has been recorded, through which it is possible to avoid the 
occurrence of abnormal sound in noise segments, such sound being 
caused when, in the smoothing of sound source gain (the second 
gain), the sound source gain smoothed in a noise segment takes 
on a value much larger than that of the sound source gain before 
smooth i ng. 

According to a first aspect of the present invention, there 
is provided a speech signal decoding method according to claim 
1. The speech signal decoding method for decoding information 
concerning at least a sound source signal, gain and linear 
prediction coefficients from a received signal, generating an 
excitation signal and linear prediction coefficients from 
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decoded information, and driving a filter, which is constituted 
by the linear prediction coefficients, by the excitation signal 
to thereby decode a speech signal, comprises: a first step of 
smoothing the gain using a past value of the gain; a second step 
5 of limiting the value of the smoothed gain based upon an amount 
of fluctuation calculated from the gain and the smoothed gain; 
and a third step of decoding the speech signal using the gain 
that has been smoothed and limited. 

According to a second aspect of the present invention, 

YQ there is provided a speech signal decoding method for decoding 
information concerning an excitation signal and linear 
prediction coefficients from a received signal, generating an 
excitation signal and linear prediction coefficients from the 
decoded information, and driving a filter, which is constituted 

15 by the linear prediction coefficients, by the exc i tat i on s i gna I 
to thereby decode a speech signal, comprising: a first step of 
deriving a norm of the excitation signal at regular intervals; 
a second step of smoothing the norm using a past value of the 
norm; a third step of limiting the value of the smoothed norm 

20 based upon an amount of fluctuation calculated from the norm and 
the smoothed norm; a fourth step of changing the amplitude of 
the excitation signal in the intervals using the norm and the 
norm that has been smoothed and limited; and a fifth step of 
driving the filter by the excitation signal the amplitude of 

25 which has been changed. 
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According to a third aspect of the present invention, there 
is provided a speech signal decoding method for decoding 
information concerning an excitation signal and linear 
prediction coefficients from a received signal, generating the 
5 excitation signal and the linear prediction coefficients from 
the decoded information, and driving a filter, which is 
constituted by the linear prediction coefficients, by the 
excitation signal to thereby decode a speech signal, comprising 
a first step of identifying a voiced segment and a noise segment 

10 with regard to the received signal using the decoded 

information; a second step of deriving a norm of the excitation 
signal at regular intervals in the noise segment; a third step 
of smoothing the norm using a past value of the norm; a fourth 
step of limiting the value of the smoothed norm based upon an 

15 amount of fluctuation derived from the norm and the smoothed 
norm; a fifth step of changing the amplitude of the excitation 
signal in the intervals using the norm and the norm that has been 
smoothed and limited; and a sixth step of driving the filter by 
the excitation signal the amplitude of which has been changed. 

20 According to a fourth aspect of the present invention, in 

the first aspect of the invention the amount of fluctuation is 
represented by dividing an absolute value of a difference 
between the gain and the smoothed gain by the gain, and the value 
of the smoothed gain is limited in such a manner that the amount 

25 of fluctuation will not exceed a certain threshold value. 
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According to a fifth aspect of the present invention, in 
the second and third aspects of the invention the amount of 
fluctuation is represented by dividing an absolute value of a 
difference between the norm and the smoothed norm by the norm, 
5 and the value of the smoothed norm is limited in such a manner 
that the amount of fluctuation will not exceed a certain 
thresho I d value. 

According to a sixth aspect of the present invention, in 
the second, thirdor fifthaspectof the invention theexcitation 
FO signal in the intervals is divided by the norm in the intervals 
* and the quotient is multiplied by the smoothed norm in the 

intervals to thereby change the amplitude of the excitation 
; s i gna I . 

F; According to a seventh aspect of the present invention, in 

T5 the second or third aspect of the invention switching between 
use of the gain and use of the smoothed gain is performed in 
accordance with an entered switching control signal when the 
speech signal is decoded. 

According to an eighth aspect of the present invention, in 

20 the second, third, fifth or sixth aspect of the invention 

switching between use of the excitation signal and use of the 
excitation signal the amplitude of which has been changed is 
performed in accordance with an entered switchingcontrol signal 
when the speech signal is decoded. 

25 According to a ninth aspect of the present invention, there 
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is provided a speech signal encoding and decoding method 
comprising encoding an input speech signal by expressing it by 
an excitation signal and linear prediction coefficients, and 
performing decoding by the speech signal decoding method 
according to any one of the first to eighth aspects of the 
invent ion. 

According to a tenth aspect of the present invention, there 
is provided a speech signal decoding apparatus for decoding 
information concerning at least a sound source signal, gain and 
linear prediction coefficients from a received signal, 
generating an excitation signal and linear prediction 
coefficients from the decoded information, and driving a filter, 
which is constituted by the linear prediction coefficients, by 
the excitation signal to thereby decode a speech signal, 
comprising: a smoothing circuit smoothing the gain using a past 
value of the gain; and a smooth i ng-quant i ty limiting circuit 
limiting the value of the smoothed gain using an amount of 
fluctuation calculated from the gain and the smoothed gain. 

According to an 11th aspect of the present invention, there 
is provided a speech signal decoding apparatus for decoding 
information concerning an excitation signal and linear 
prediction coefficients from a received signal, generating the 
excitation signal and linear prediction coefficients from the 
decoded info rmati on, and driving a filter, wh i ch is cons t i tu ted 
by the linear prediction coefficients, by the excitation signal 
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to thereby decode a speech signal, comprising: an 
excitation-signal normalizingcircuit calculating a norm of the 
excitation signal at regular intervals and dividing the 
excitation signal by the norm; a smoothing circuit smoothing the 
5 norm using a past value of the norm; a smoo t h i ng-quan t i t y 
limiting circuit limiting the value of the smoothed norm using 
an amount of fluctuation calculated from the norm and the 
smoothed norm; and an excitation-signal reconstruction circuit 
multiplying the smoothed and limited norm by the excitation 

tO signal to thereby change the amplitude of the excitation signal 
in the intervals. 

According to a 12th aspect of the present invention, the 
foregoing object is attained by providing a speech signal 
decoding apparatus for decoding information concerning an 

15 excitation signal and linear prediction coefficients from a 
received signal, generating the excitation signal and linear 
prediction coefficients from the decoded information, and 
driving a filter, which is constituted by the linear prediction 
coefficients, by the excitation signal to thereby decode a 

20 speech signal, comprising a voiced/unvoiced identification 
circuit identifying a voiced segment and a noise segment with 
regard to the received signal using the decoded information; an 
excitation-signal normal izing circuit calculating (deriving) a 
norm of the excitation signal at regular intervals and dividing 

25 the excitation signal by the norm; a smoothing circuit for 
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smoothing the norm using a past value of the norm; a 
smooth i ng-quan t i ty limiting circuit limiting the value of the 
smoothed norm using an amount of fluctuation calculated from the 
norm and the smoothed norm; and an excitation-signal 
5 reconstruction circuit multiplying the smoothed and limited 
norm by the excitation signal to thereby change the amplitude 
of the excitation signal in the intervals. 

According to a 13th aspect of the present invention, in the 
10th aspect of the invention the amount of fluctuation is 

10 represented by dividing an absolute value of a difference 

between the ga i n and the smoothed ga i n by the ga i n, and the value 
of the smoothed gain is limited in such a manner that the amount 
of fluctuation will not exceed a certain threshold value. 

- According to a 14th aspect of the present invention, in the 

15 11th and 12th aspects of the invention the amount of fluctuation 
is represented by dividing the absolute value of the difference 
between the norm and the smoothed norm by the norm, and the value 
of the smoothed norm is limited in such a manner that the amount 
of fluctuation will not exceed a certain threshold value. 

20 According to a 15th aspect of the present invention, in the 

10th or 13th aspect of the invention, the apparatus comprises 
a switching circuit in which switching between use of the gain 
and use of the smoothed gain is performed in accordance with an 
entered switching control signal when the speech signal is 

25 decoded. 
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According to a 16th aspect of the present invention, in the 
11th, 12th or 14th aspect of the invention, the apparatus 
comprises a switching circuit in which switching between use of 
the excitation signal and use of the excitation signal the 
5 amplitude of which has been changed is performed in accordance 
with an entered switching control signal when the speech signal 
is decoded. 

According to an 17th aspect of the present invention, there 
is provided a speech signal encoding and decoding apparatus 

10 comprising: a speech s i gna I encod i ng apparatus encod i ng an input 
speech signal byexpressing it by an excitation si gnal and linear 
prediction coefficients, and a speech signal decoding apparatus 
according to any one of the 10th to 16th aspects of the invention. 
According to an 18th aspect of the present invention, there 

15 is provided a program product, or a medium on which has been 
recorded the program product, for implementing a speech signal 
decoding method for decoding information concerning at least a 
sound source signal, gain and linear prediction coefficients 
from a received signal, generating the excitation signal and the 

20 linear prediction coefficients from the decoded information, 
and driving a filter, which is constituted by the linear 
prediction coefficients, by the excitation signal to thereby 
decode a speech signal, wherein the program causes a computer 
to execute processing which includes smoothing the gain using 

25 a past va I ue of the ga i n ; I i m i t i ng the va I ue of the smoothed ga i n 
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based upon an amount of fluctuation calculated from the gain and 
the smoothed gain; and decoding the speech signal using the gain 
that has been smoothed and limited. 

According to an 19th aspect of the present invention, there 
5 is provided a program product for implementing a speech signal 
decoding method for decoding information concerning an 
excitation signal and linear prediction coefficients from a 
received signal, generating an excitation signal and linear 
prediction coefficients from the decoded information, and 

10 driving a filter, which is constituted by the linear prediction 
coefficients, by the excitation signal to thereby decode a 
speech signal. The program product causes a computer to execute 
processing which includes: (a) calculating a norm of an 
excitation signal at regular intervals and smoothing the norm 

15 using a past value of the norm;(b) limiting the value of the 
smoothed norm; based upon an amount of fluctuation calculated 
from the norm and the smoothed norm; and (c) changing the 
amplitude of the excitation signal in the intervals using the 
norm and the norm that has been smoothed and I imited; and driving 

20 the filter by the excitation signal the amplitude of which has 
been changed. 

According to an 20th aspect of the present invention, there 
is provided a program product for implementing a speech signal 
decoding method for decoding information concerning an 
25 excitation signal and linear prediction coefficients from a 
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received signal, generating an excitation signal and linear 
prediction coefficients from the decoded information, and 
driving a filter, which is constituted by the linear prediction 
coefficients, by the excitation signal to thereby decode a 
5 speech signal. The program product causes a computer to execute 
processing which includes: (a) identifying a voiced segment and 
a noise segment with regard to a received signal using decoded 
i nf ormat i on ; (b) calculating a norm of an excitation signal at 
regular intervals in the noise segment and smoothing the norm 

%Q using a past value of the norm;(c) limiting the value of the 
smoothed norm using an amount of fluctuation calculated from the 
norm and the smoothed norm; and (d)changingtheamplitudeof the 
excitation signal in the intervals using the norm and the norm 

: that has been smoothed and limited; and driving the filter by 

15 the excitation signal the amplitude of which has been changed. 

According to a 21st aspect of the present invention, in the 
18th aspect of the invention there is provided a program product 
which includes representing the amount of fluctuation by 
dividing an absolute value of a difference between the gain and 

20 the smoothed gain by the gain, and limiting the value of the 
smoothed gain in such a manner that the amount of fluctuation 
will not exceed a certain threshold value. 

According to a 22nd aspect of the present invention, in the 
19th or 20th aspect of the invention there is provided a program 

25 product which includes representing the amount of fluctuation 
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by dividing the absolute value of the difference between the norm 
and the smoothed norm by the norm, and limiting the value of the 
smoothed norm in such a manner that the amount of fluctuation 
will not exceed a certain threshold value. 
5 According to a 23rd aspect of the present invention, in the 

19th, 20th or 22nd aspect of the invention there is provided a 
program product which includes dividing the excitation signal 
in the intervals by the norm in the intervals and multiplying 
'*] the quotient by the smoothed norm in the intervals to thereby 
10 change the amplitude of the excitation signal. 
l> According to a 24th aspect of the present invention, in the 

»4 18th or 21st aspect of the invention there is provided a program 
j_ product which includes switching between use of the gain and use 
j the smoothed gain in accordance with an entered switching 
15 control signal when the speech signal is decoded. 

According to a 25th aspect of the present invention, in the 
19th, 20th, 22nd and 23rd aspect of the invention there is 
provided a program product which includes switching between use 
of the excitation signal and use of the excitation signal the 
20 amp I itude of which has been changed in accordance with an entered 
switching control signal when the speech signal is decoded. 

According to a 26th aspect of the present invention, there 
is provided a program product which includes encoding an input 
speech signal by expressing it by an excitation signal and I inear 
25 prediction coefficients, and performing decoding by the speech 
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signal decoding method according to any one of the first, to 
eighth aspects of the invention. 

According to a further aspect the program product may be 
carried by a suitable medium which includes dynamic and/or 
5 static medium, such as a recording medium, and/or carrier wave 
etc. 

Other aspects are disclosed in the claims 27 et seq, which 
are incorporated herein by reference thereto. 

Other objects, features and advantages of the present 
10 invention will be apparent to those skilled in the art from the 
following description taken in conjunction with the 
accompanying drawings, in which like reference characters 
designate the same or similar parts throughout the figures 
thereof. 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram illustrating the construction of 
a speech signal decoding apparatus according to a first 
embodiment of the present invention; 

Fig. 2 is a block diagram illustrating the construction of 
20 a speech signal decoding apparatus according to a second 
embodiment of the present invention; 

Fig. 3 is a block diagram illustrating the construction of 
a speech signal decoding apparatus according to a third 
embodiment of the present invention; 
25 Fig. 4 is a block diagram illustrating the construction of 
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a speech signal decoding apparatus according to a fourth 
embodiment of the present invention; 

Fig. 5 is a block diagram illustrating the construction of 
a speech signal decoding apparatus according to a fifth 
5 embodiment of the present invention; 

Fig. 6 is a block diagram illustrating the construction of 
a speech signal decoding apparatus according to a sixth 
embodiment of the present invention; 

Fig. 7 is a block diagram illustrating the construction of 
■tO a speech signal decoding apparatus according to an embodiment 
: of the present invention; 

Fig. 8 is a block diagram illustrating the construction of 
a speech signal decoding apparatus according to the prior art; 
; and 

is Fig. 9 is a block diagram illustrating the construction of 

7j a speech signal encoding apparatus according to the prior art. 
PREFERRED EMBODIMENTS OF THE INVENTION 

Preferred modes of practicing the present invention will 
now be described. 

20 In the present invention, asmoothingcircuit (1320 in Fig. 

1) smoothes sound source gain (second gain) in a noise segment 
using sound source gain obtained in the past, and a 
s mo othing- quantity limiting circuit (7200 in Fig. 1) obtains the 
amount of fluctuation between the sound source gain (second 

25 gain) and the sound sou rce ga i n smoothed by the smooth i ng c i rcu i t 
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(1320 in Fig. 1) and limits the value of the smoothed gain in 
such a manner that the amount of fluctuation will not exceed a 
certain threshold value. Thus, the values that can be taken on 
by the smoothed sound source gain are limited based upon an 
amount of fluctuation calculated using a difference between the 
smoothed sound source gain and the sound source gain in such a 
manner that the sound source gain smoothed in the noise segment 
will not take on a value that is very large in comparison with 
the sound source gain before smoothing. As a result, the 
occurrence of abnormal sound in the noise segment is avoided. 

In a first preferred mode of the present invention, as shown 
in Fig. 1, a speech signal decoding apparatus is for decoding 
information concerning at least a sound source signal, gain and 
linear prediction (LP) coefficients from a received signal, 
generating an excitation signal and linear prediction 
coefficients from the decoded information, and drivinga filter, 
which is constituted by the linear prediction coefficients, by 
the excitation signal to thereby decode a speech signal, and the 
apparatus includes a smoothing circuit (1320) for smoothing the 
gain using a past value of the gain, and smoo thing-quantity 
limiting circuit (7200) for limiting the value of the smoothed 
gain using an amount of fluctuation calculated from the gain and 
the smoothed gain. The smooth i ng-quan t i ty limiting circuit 
(7200) obtains the amount of fluctuation by dividing the 
absolute value of the difference between sound source gain 
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(second gain) and the smoothed sound source gain by the sound 
source gain. 

More specifically, the apparatus includes: a code input 
circuit (1010) for splitting code of the a bit sequence of an 
5 encoded input signal that enters from an input terminal, 

converting the code to indices that correspond to a plurality 
of decode parameters, outputting an index corresponding to a 
line spectrum pair (LSP), which represents frequency 
characteristic of the input signal, to an LSP decoding circuit, 

10 outputting an index corresponding to a delay that represents the 
pitch period of the input signal to a pitch signal decoding 
circuit, outputting an index corresponding to a sound source 
vector comprising a random number or a pulse train to a sound 
source signal decoding circuit, outputting an index 

T~5 corresponding to a first gain to a first gain decoding circuit, 
and outputting an index corresponding to a second gain to a 
second gain decoding circuit; the LSP decoding circuit (1020), 
to which the index output from the code input circuit (1010) is 
input, for reading the LSP corresponding to the input index out 

20 of a table which stores LSPs corresponding to indices, obtains 
an LSP in a subframe of the present frame (the nth frame), and 
outputs the LSP; the linear prediction coefficient conversion 
circuit (1030), to which the LSP output from the LSP decoding 
circuit is input, for converting the LSP to linear prediction 

25 coefficients and outputting the coefficients to a synthesis 
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filter; the sound source signal decoding circuit (1110), to 
which the index output from the code input circuit (1010) is 
input, for reading a sound source vector corresponding to the 
index out of a table which stores sound source vectors 
corresponding to indices, and outputting the sound source vector 
to a second gain decoding circuit; the second gain decoding 
circuit (1120), to which the index output from the code input 
circuit (1010) is input, for reading a second ga in corresponding 
to the input index out of a table which stores second gains 
corresponding to indices, and outputting the second gain to a 
smoothing circuit; the second gain circuit (1130), to which a 
first sound source vector output from the sound source signal 
decoding circuit (1110) and the second gain are input, for 
multiplying the first sound source vector by the second gain to 
generate a second sound source vector and outputting the 
generated second sound source vector to the adder (1 050) ; the 
memory circuit (1240) for holding an excitation vector input 
thereto from the adder (1 050) and outputting a held excitation 
vector, which was input thereto in the past, to the pitch signal 
decoding circuit (1210); the pitch signal decoding circuit 
(1210), to which the past excitation vector held by the memory 
circuit (1 240) and the index (which specifies a delay L pd ) output 
from the code input circuit (1010) are input, for cutting vectors 
of samples corresponding to the vector length from a point L pd 
samples previous to the starting point of the present frame, 
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generating a first pitch vector and outputting the first pitch 
vector to the first gain circuit (1 230) ; the first gain decoding 
circuit (1220), to which the index output from the code input 
circuit (1010) is input, for reading a first gain corresponding 
5 to the input index out of a table and outputting the first gain 
to a f i rs t ga i n c i rcu i t ; the f i r s t ga i n c i r cu i t (1 230), to which 
the first pitch vector output from the pitch signal decoding 
circuit (1210) and the first gain output from the first gain 
decoding circuit (1220) are input, for multiplying the input 

10 first pitch vector by the first gain to generate a second pitch 
vector and outputting the generated second pitch vector to the 
adder; the adder (1050), to wh i ch the second p i tch vector output 
from the first gain circuit (1230) and the second sound source 
vector output from the second gain circuit (1130) are input, for 

t.5 calculating the sum of these inputs and outputting the sum to 
the synthesis filter (1040) as an excitation vector; the 
smoothing coefficient calculation circuit (1310), to which LSP 
output from the LSP decoding circuit (1020) is input, for 
calculating average LSP in an nth frame, finding the amount of 

20 fluctuation of the LSP with respect to each subframe, finding 
a smoothing coefficient in the subframe and outputting the 
smoothing coefficient to a smoothing circuit; the smoothing 
circuit (1320), to which the smoothing coefficient output from 
the smoothing coefficient calculation circuit (1310) and the 

25 second gain output from the second gain decoding circuit are 
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input, for finding the average gain from the second gain in the 
subframe and outputting the second gain; the synthesis filter 
(1040), to which the excitation vector output from the adder 
(1050) and the linear prediction coefficients output from the 
5 linear prediction coefficient conversion circuit (1030) are 
input, for driving a synthesis filter, for which the linear 
prediction coefficients have been set, by the excitation vector 
to thereby calculate a reconstructed vector, and outputting the 
reconstructed vector from an output terminal; and the 

\Q smoothing- quantity limiting circuit (7200), to which the second 
gain output from the second gain decoding circuit (1120) and the 
smoothed second gain output from the smoothing circuit (1320) 
are input, for finding the amount of fluctuation between the 
smoothed second gain output from the smoothing circuit (1320) 

IB and the second gain output from the second gain decoding circuit 
(1120), using the smoothed second gain as is when the amount of 
fluctuation is less than a predetermined threshold value, 
replacing the smoothed second gain with a smoothed second gain 
limited in terms of the values it is capable of taking on when 

20 the amount of fluctuation is equal to or greater than the 
threshold value, and outputting this smoothed second gain to the 
second ga i n circuit (1130). 

In a second preferred mode of the present invention, as 
shown in Fig. 2, a speech signal decoding apparatus is for 

25 decoding information concerning an excitation signal and linear 
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prediction coefficients from a received signal, generating an 
excitation signal and linear prediction coefficients from the 
decoded information, and driving a filter, which is constituted 
by the linear prediction coefficients, by the excitation signal 
5 to thereby decode a speech signal. Particularly, the apparatus 
includes an excitation-signal normalizing circuit (2510) for 
deriving a norm of the excitation signal at regular intervals 
and dividing the excitation signal by the norm; a smoothing 
circuit (1 320) for smoothing the norm using a past value of the 

10 norm; a smooth i ng-quan t i ty I i m i t i ng c i rcu i t (7200) for limiting 
the value of the smoothed norm using an amount of fluctuation 
calculated from the norm and the smoothed norm; and an 
excitation-signal reconstruction circuit (2610) for 
multiplying the smoothed and limited norm by the excitation 

15 signal to thereby change the amplitude of the excitation signal 
i n the i nterva I s. 

More specifically, the apparatus includes: an 
excitation-signal normalizing circuit (2510), to which an 
excitation vector in a subframe output from the adder (1050) is 

20 input, for calculating gain and a shape vector from the 

excitation vector every subframe or every sub-subframe obtained 
by subdividing a subframe, outputting the gain to the smoothing 
circuit (1320) and outputting the shape vector to an 
excitation-signal reconstruction circuit (2610); and the 

25 excitation-signal reconstruction circuit (2610), to which the 
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gain output from the smoothing- quantity limiting circuit (7200) 
and the shape vector output from the excitation-signal 
normalizing circuit (2 510) areinput, for calculating a smoothed 
excitation vector and outputting this excitation vector to the 
5 memory circuit (1240) and synthesis filter (1040). In this 
apparatus, the smooth i ng-quan t i ty limiting circuit (7200) has 
the output of the smoothing circuit (1 320) applied to one input 
terminal thereof and has the output of the excitation-signal 
norma I i z i ng c i rcu i t (2510), rather than the output of the second 

10 gain decoding circuit (1120) as in the first mode, applied to 
the other input terminal thereof, finds the amount of 
fluctuation between the smoothed gain output from the smoothing 
circuit (1320) and the gain output from the excitation-signal 
normalizing circuit (2510), uses the smoothed gain as is when 

1=5 the amount of fluctuation is less than a predetermined threshold 

~ value, replaces the smoothed gain with a smoothed gain limited 
in terms of values it is capable of taking on when the amount 
of fluctuation is equal to or greater than the threshold value, 
and supplies this smoothed gain to the excitation-signal 

20 reconstruction circuit (2610); the output of the second gain 
decoding circuit (1120) is input to the second gain circuit 
(1130) as second gain; and the smoothing circuit (1320) has the 
output of the excitation-signal normalizing circuit (2510), 
rather than the output of the second gain decoding circuit (1120) 

25 as in the first mode, applied thereto, as well as the output of 
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the smoothing coefficient calculation circuit (1310). 

In a third preferred mode of the present invention, as shown 
in Fig. 3, a speech signal decoding apparatus is for decoding 
information concerning an excitation signal and linear 
prediction coefficients from a received signal, generating an 
excitation signal and linear prediction coefficients from the 
decoded information, and driving a filter, which is constituted 
by the linear prediction coefficients, by the excitation signal 
to thereby decode a speech signal, and the apparatus includes: 
avoiced/unvoiced identification circuit (2020) f o r i den t i f y i ng 
a voiced segment and a noise segment with regard to the received 
signal using the decoded information; the excitation-signal 
normalizing circuit (2510) for calculating a norm of the 
excitation signal at regular intervals and dividing the 
excitationsignal by the norm; the smoothing circuit (1320) for 
smoothing the norm using a past value of the norm; the 
smooth i ng-quant i ty limiting circuit (7200) for limiting the 
value of the smoothed norm using an amount of fluctuation 
calculated from the norm and the smoothed norm; and an 
excitation-signal reconstruction circuit (2610) for 
multiplying the smoothed and limited norm by the excitation 
signal to thereby change the amplitude of the excitation signal 
in the intervals. 

More specifically, the apparatus includes: a power 
calculation circuit (3040), to which the reconstructed vector 
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output from the synthesis filter (1040) is input, for 
calculating the sum of the squares of the reconstructed vector 
and outputting the power to a voiced/unvoiced identification 
circuit; a speech mode decision circuit (3050), to which a past 
excitation vector held by the memory circuit (1 240) and an index 
specifying a delay output from the code input circu it (1010) are 
input, for calculating a pitch prediction gain in a subframe from 
the past excitation vector and delay, determining a 
predetermined threshold value with respect to the pitch 
prediction gain or with respect to an in-frame average value of 
the pitch prediction gain in a certain frame, and setting a 
speech mode; the voiced/unvoicedidentification circuit (2020), 
to which an LSP output from the LSP decoding ci rcui t (1 020), the 
speech mode output from the speech mode decision circuit (3050) 
and the power output from the power calculation circuit (3040) 
are input, for finding the amount of fluctuation of a spectrum 
parameter and identifying a voice segment and an unvoiced 
segment based upon the amount of fluctuation; a noise 
c I ass i f i cat i on c i rcu i t (2030), to wh i ch amount-of-f I uctuat i on 
information) and an identification flag output from the 
voiced/unvoiced identification circuit (2020) are input, for 
classifying noise; and a first changeover circuit (2110), to 
which the gain output from an excitation-signal normalizing 
circuit (2510), an identification flag output from the 
voiced/unvoiced identification circuit (2020) and a 
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classification flag output from the noise classification 
circuit (2030) are input, for changing over a switch in 
accordance with a value of the identification flag and a value 
of the classification flag to thereby switchingly output the 
5 gain to any one of a plurality of filters (2150, 2160, 2170) 
having different filter characteristics from one another; 
wherein the filter selected from among the plurality of filters 
(21 50, 21 60, 21 70) has the gain output from the first changeover 
circuit (2110) applied thereto, smoothes the gain using a linear 

10 fi Iter or non-l inear fi Iter and outputs the smoothed gain to the 
smooth i ng-quan t i ty limiting circuit (7200) as a first smoothed 
gain; and the smoo t h i ng-quan t i t y limiting circuit (7200) has the 
first smoothed gain output from the selected filter applied to 
one input terminal thereof, has the output of the 

15 excitation-signal normalizing circuit (2510) applied to the 
other input terminal thereof, finds the amount of fluctuation 
between the gain output from the excitation-signal normalizing 
circuit (2510) and the first smoothed gain output from the 
selected filter, uses the first smoothed gain as is when the 

20 amount of fluctuation is less than a predetermined threshold 
value, replaces the first smoothed gain with a smoothed gain 
limited in terms of values it is capable of taking on when the 
amount of fluctuation is equal to or greater than the threshold 
value, and supplies this smoothed gain to the excitation-signal 

25 reconstruction circuit (2610). 
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In a preferred mode of the present invention, as shown in 
Fig. 4, switching between use of the gain and use of the smoothed 
gain may be performed by a changeover circuit (7110) in 
accordance with an entered switching control signal when the 
5 speech signal is decoded. 

In a preferred mode of the present invention, as shown in 
Fig. 5 or 6, the apparatus further includes a second changeover 
circuit (7110), to which the excitation vector output from the 
adder (1050) is input, for outputting the excitation vector to 
10 the synthesis filter (1040) or to the excitation-signal 

normalizing circuit (2510) in accordance with a changeover 
control signal, which has entered from an input terminal (50), 
when the speech signal is decoded. 

Embodiments of the present invention will now be described 
15 with reference to the drawings in order to explain further the 
modes of the invention set forth above. 

Fig. 1 is a block diagram illustrating the construction of 
a speech signal decoding apparatus according to a first 
embodiment of the present invention. Components in Fig. 1 
20 identical with or equivalent to those shown in Fig. 8 are 
identified by like reference characters. 

In Fig. 1, the input terminal 10, output terminal 20, code 
i nput c i rcu i t 1 01 0, LSP decod i ng c i r cu i t 1 020, I i near pred i ct ion 
coefficient conversion circuit 1030, sound source signal 
25 decoding circuit 1110, memory circuit 1240, pitch signal 
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decod i ng c i rcu i t 1 21 0, first gain decodingcircuit 1220, second 
gain decoding circuit 1120, f i rs t ga i n c i rcu i t 1 230, second gain 
circuit 1130, adder 1050, smoothing coefficient calculation 
circuit 1310, smoothing circuit 1320 and synthesis filter 1 040 
5 are identical with the similarly identified components shown in 
Fig. 8 and need not be described again. The entire description 
made in the introductory part of this application with respect 
to Fig. 8 is hereby incorporated as part of the disclosure of the 
present invention, as far as it relates to the present invention, 

10 too. Pr i mar i I y, on I y components that differ from those shown in 
Fig. 8 will be described below. 

In the first embodiment of the present invention 
illustrated in Fig. 1, the s mo othing- quantity limiting circuit 
7200 has been added onto the arrangement of Fig. 8. As in the 

15 arrangement of Fig. 8, in the first embodiment of the invention 
it is assumed that the input of the bit sequence occurs in T fr 
msec (e.g., 20 ms) and that computation of the reconstructed 
vector is performed in a period (subframe) ofT fr /N sfr msec (e. g. , 
5 ms) , where N sfr is an integer (e.g., 4). Let frame length be 

20 L fr samples (e.g., 320 samples) and let subframe length be L sfr 
samples (e.g., 80 samples). The numbers of these samples is 
decided by the sampling frequency (e.g., 16 kHz) of the input 
s i gna I . 

The second gain (represented by g 2 ) output from the second 
25 gain decoding circuit 1120 and the smoothed second gain 



40 



(represented by ~ g 2 ) output from the smoothing circuit 1320 are 
input to the smooth i ng-quan t i ty limiting circuit 7200. 

The second gain ~~ g 2 output from the smoothing circuit 1320 
is limited in terms of the values it can take on in such a manner 
5 that it will not become abnormally large or abnormally small in 
comparison with the second gain g 2 output from the second gain 
decod i ng circuit 1120. 

First, let amount d g2 of fluctuation of g 2 be 
representedby 
10 d g2 = |~~ g 2 -g 2 |/g 2 -(11) 

When the fluctuation amount d g2 is less than a certain 

threshold value C g2 , is used as is. When the fluctuation amount 
d g2 is equal to or greater than the threshold value C g2 . is 
limited. That is, g 2 is replaced using the following 
15 criterion: 

if (d g 2 <C g 2 ) then ~~ g 2 = — g 2 
else if ( — g 2 -g 2 >0 ) then — g 2 = ( 1 + C g z ) • g 2 
else — g 2 = ( 1 - C„ 2 ) • g 2 
In other words, 
20 if d g2 <C g2 is true, then ~~ g 2 is used as is; 

if d g2 <C g2 is false (i.e., if d g 2 ^ C g 2 ho I ds) , then a 
substitution is made for as follows: 

— g 2 =(1+C g2 ) -g 2 wn en ~~ g 2 -g 2 >0 holds true; and 
~~ g z = (1-C g2 ) -g 2 when — g 2 -g 2 ^0 holds true. 

25 Here it is assumed that C g2 =0. 90 holds. 

Finally, the smooth i ng-quan t i ty limiting circuit 7200 
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outputs the substitute ~~ g 2 to the second gain circuit 1130. 

A second embodiment of the present invention will now be 
descr i bed. 

Fig. 2 is a block diagram illustrating the construction of 
5 a speech signal decoding apparatus according to a second 
embodiment of the present invention. Components in Fig. 2 
identical with or equivalent to those shown in Figs. 1 and 8 are 
identified by like reference characters. 

As shown in Fig. 2, the second embodiment is so adapted that 

10 the norm of the excitation vector is smoothed instead of the 
decoded sound source gain (the second gain] as in the first 
embodiment. It should be noted that the input terminal 10, 
output terminal 20, code input circuit 1010, LSP decoding 
circuit 1020, linear prediction coefficient conversion circuit 

1 5 1 030, sound sou rce s i gna I decod i ng c i rcu i t 1 1 1 0, memo r y c i r cu i t 
1240, pitch signal decoding circuit 1210, first gain decoding 
circuit 1220, second gain decoding circuit 1120, first gain 
circuit 1230, second gain circuit 1130, adder 1050, smoothing 
coefficient calculation circuit 1310, smoothing circuit 1320 

20 and synthesis filter 1040 are identical with the similarly 
identified components shown in Fig. 8 and need not be described 
aga i n. 

As shown in Fig. 2, the second embodiment of the invention 
additionally provides the arrangement of the first embodiment 
25 illustrated in Fig. 1 with the excitation-signal normalizing 
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circuit 2510, the input towhich is the output of the adder 1050, 
and with the excitation-signal reconstruction circuit 2610, the 
inputs to which are the outputs of the excitation-signal 
normalizing circuit 2510 and smooth i ng-quan t i ty limiting 
5 circuit 7200 and the output of which is delivered to synthesis 
filter 1040 and memory circuit 1240. 

The output of the smoothing circuit 1 320 and the output of 
the excitation-signal normalizing circuit 2510 are input to the 
smooth i ng-quan t i ty limiting circuit 7200, which supplies its 

10 output to the excitation-signal reconstruction circuit 2610. 
In other aspects this embodiment is similar to the first 
embodiment except for the signal connections. 

The excitation-signal normalizing circuit 2510 and 
excitation-signal reconstruction circuit 2610 will now be 

I 5 descr i bed. 

An excitation vector X exc (ni) (i) (where i = 0, L sfr -1, 
m = 0, N sfr -1) in an m t h subsample output from the adder 1050 
is input to the excitation-signal normalizing circuit 2510. 
The latter calculates gain and a shape vector from the excitation 
20 vec t o r X e x c ( m 1 (i) every subf rame or every sub-subf rame obta i ned 
by subdividing a subframe, outputs the gain to the smoothing 
circuit 1320 and outputs the shape vector to the excitation- 
signal reconstruction circuit 2610. A norm represented by 
Equation (12) below is used as the gain. 
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/L s& /N ssfr -i r 

gexc(m-N ssfr +1)- £ x^(l--^ + n) 2 , 



V n=0 1X1 ssfr 

m = 0,-"-,N sfr - l = 0,---,N ssfr -1 



■ " "(12) 



where N s s f r represents the number of subd i v i s i ons (the number of 
sub-subf rames) of a subframe (e.g.,N ssfr = 2). The 
excitation-signal normalizing circuit 2510 calculates the shape 
vector, which is obtained by dividing the excitation vector 
X exc (B) (i) by gain g exc (j) (where j = 0, ... N s s f r • N s f r - 1 ) , in 
accordance with Equation (13) below. 



Thegaing exc (j) (where j=0, •••N ssfr -N sfr -1) output from 
the smoothing circuit and a shape vector s 6XC (jl (i) output from 
the excitation-signal normalizing circuit 2510 are input to the 
excitation-signal reconstruction circuit 2610. The latter 
calculates a (smoothed) excitation vector ~X exc lm) (i) in 
accordance with Equation (14) below and outputs the excitation 
vector to the memory circuit 1240 and synthesis filter 1040. 



A third embodiment of the present invention will now be 
descr i bed. 

Fig. 3 is a block diagram illustrating the construction of 




■ " "(13) 



*i£ a • + i) = gexc (m • N ssfr + 1) • s£ N - +1) (i), 

i = O,- ■ -,L sft /N ssfr -1,1 = 0,---, N ssfr - l,m = O,- - N ssfr - 1 



■ ■ -(14) 
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a speech signal decoding apparatus according to a second 
embodiment of the present invention. Components in Fig. 3 
identical with or equivalent to those shown in Figs. 2 and 8 are 
identified by I ike reference characters. The input terminal 10, 
5 output terminal 20, code input circuit 1010, LSP decoding 
circuit 1020, linear prediction coefficient conversion circuit 
103 0, sound source signal decoding circuit 1110, memory circuit 
1240, pitch signal decoding circuit 1210, first gain decoding 
circuit 1220, second gain decoding circuit 1120, first gain 

10 circuit 1230, second gain circuit 1130, adder 1050, smoothing 
coefficient calculation circuit 1310, smoothing circuit 1320 
and synthesis filter 1040 are identical with the similarly 
i den t i f i ed componen ts shown i n F i g. 8, and the excitation-signal 
normalizing circuit 2510 and excitation-signal reconstruction 
-15 circuit 2610 are identical with those shown in Fig. 2. 

Accordingly, these components need not be described again. 
Further, the smoo th i ng-qu an t i t y limiting circuit 7200 is 
similar to that of the first embodiment except for a difference 
in the connections. 

20 As shown in Fig. 3, the third embodiment of the invention 

additionally provides the arrangement of the second embodiment 
illustrated in Fig. 2 with the power calculation circuit 3040, 
speech mode decision circuit 3050, voiced/unvoiced 
identification circuit 2020, noise classification circuit 2030, 

25 first changeover circuit 2110, a first filter 2150, a second 
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filter 2160 and a third filter 2170. How this embodiment 
differs from the second embodiment will now be described. 

The reconstructed vector output from the synthesis filter 
1040 is input to the power calculation circuit 3040. The latter 
calculates the sum of the squares of the reconstructed vector 
and outputs the power to a voiced/unvoiced identification 
circuit 2020. Here the power calculation circuit 3040 
calculates power every subframe and uses the reconstructed 
vector output from the synthesis filter 1 040 in an (in— 1 ) t h 
subframe in the calculation of power in an rath subframe. 
Letting the reconstructed vector be represented S s y n (i), i=0, 
•••,L sfr , power E pow is calculated in accordance with Equation 
(15) below. 



It is also possible to use the norm of the reconstructed 
vector represented by Equation (16) below instead of Equation 
(15) . 



A past excitation vector e mein (i), i =0, ■•■ , L B e „ -1 held by the 
memory circuit (1 240) and the index output from the code input 
circuit 1010 are input to the speech mode decision circuit 3050. 
The index specifies a delay L pd . Here L mem represents a constant 
decided by the maximum value of L pd . The speech mode decision 



e pow =7^-10) 



■■■(15) 




■ ■ -(16) 



46 



circuit 3050 calculates a pitch prediction gain G enieni (m) , m=0, 1, 
■•■ , N sf r in the m t h subframe from a past excitation vector e mera (i) 
and the delay L pd . 
G eraem On) =10- log, „ (g emem (m)) - (17) 
5 where 

t - w "; — 

E al (m)E a2 (m) 
E al (m)=l4(i) 

i=0 

Ea 2 (m) = L £eL(i-L pd ) 

i=0 

E c (m)= £e^ m (i)e mem (i-L pd ) ■ ■ -(18) 

1=0 

The speech mode decision circuit 3050 executes the 

following threshold-value processing with respect to the pitch 

prediction gain G emem (m) or with respect to an in-frame average 

10 value of the pitch prediction gain G eraeni (m) in the nth frame, 

thereby setting a speech mode S„, ode : 

if ( _ G emem (n) ^3. 5) then S raode = 2 
else S B 0 „ e =0 

That is, if — G e B e „ (n) ^ 3. 5 holds, then the S mode is 2; 
15 otherw i se, the S a 0 d e i s 0. 

The speech mode decision circuit 3050 outputs the speech 
mode S raode to the voiced/unvoiced identification circuit 2020. 

LSPq " j (m) (n) output f rom the LSP decod i ng c i rcu i t 1 020, the 
speech mode S mode output from the speech mode decision circuit 
20 3050 and the power E pow output from the power calculation circuit 
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3040 are input to the voiced/unvoiced identification circuit 
2020. A procedure for obtaining the amount of fluctuation of 
a spectrum parameter is indicated below. Here LSP q", {m] (n) is 
used as the spectrum parameter. The voiced/unvoiced 
5 identification circuit 2020 calculates a long-term average q 
— j (n) in a (n) frame in accordance with Equation (19) below. 

q } (n) - P 0 • qj (n - 1) + (1 - P„) ■ ^ j = U" " %N p ■ ■ -(19) 

where /3 0 =0.9 Amount d Q (n) of deviation (fluctuation) of LSP 
in the nth frame is defined by Equation (20) below. 

D (m) (n) 

'o ^-EL^cST - (20) 

where D (m) aj (n) corresponds to the distance between (n) and 
V"', (n) . For example, Equations (21a) and (21b) below are 
used. 

Dj> (n) = (q } (n) - q< m) (n)) 2 ■ ■ -(21a) 

Dj>(n) = |q j (n)-qS m) (n)| ■ ■ -(21b) 

15 In this embodiment, the absolute value of Equation (21b) 

i s used as the d i stance. 

Approximate correspondence can be established between an 
interval where the fluctuation d q (n) is large and a voiced 
segment and between an interval where the fluctuation d Q (n) is 
20 small and an unvoiced (noise) segment. 

However, the amount of fluctuation d„ (n) varies greatly 
with time and the range of values of d p (n) in a voiced segment 
and the range of values of d„ (n) in an unvoiced segment overlap 
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each other. A problem which arises is that it is not easy to 
set a threshold value for distinguishing between voiced and 
unvo i ced segments. Accordingly, the I ong-term average of d p (n) 
is used in the identification of the voiced and unvoiced 
5 segments. 

The long-term average of d pl (n) is found using a linear 
or non-linear filter. By way of example, the mean, median or 
mode of d q (n) can be employed as d — a1 (n) . Here Equation (22) 
is used. 

10 d ql (n) - P ■ d ql (n - 1) + (1 - P , ) • d q (n) - - -(22) 

where )8 n =0. 9 hoi ds. 

An identification flag S vs is decided by applying 
threshold-value processing to (~~ d ql (n)^C th1 ) then S v s = 1 
else S v s = 0 

15 That is, if — d q1 (n)^C thl holds, S v s is 1; otherwise, 

S vs =0 holds. 

Here C t h , represents a certain constant (e.g., 2.2), and 
S vs =1 corresponds to a voiced segment and S ¥S =0 to an unvoiced 
segmen t. 

20 Since d q (n) is small in an interval where there is a high 

degree of steadiness, even in a voiced segment, the voiced 
segment may be mistaken for an unvoiced segment. Accordingly, 
in a case where the power of a frame is high and the pitch 
prediction gain is high, the segment is regarded as being a 

25 voiced segment. When S vs =0holds, S vs is revised in accordance 
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with the following criterion: 

if ("E ris ^C ril and S mode ^2) then S vs =1 
else S vs = 0 

That is, if ~E rms ^C rnis and S mode ^2 hold, S v s is 1; 
5 otherwise, S vs is 0. 

Here C rms (where rms stands for the root-mean-square value) 
represents a certain constant (e.g., 1 0,000). The relation 
S node ^2 corresponds to a case where the in-frame average value 
of pitch prediction gain is equal to or greater than 3. 5 dB. The 
10 voiced/unvoiced identification circuit 2020 outputs S vs to the 
noise classification circuit 2030 and first changeover circuit 
2110 and outputs to the noise classification circuit 2030. 

The inputs to the noise classification circuit 2030 are 
d a1 (n) and S v s output from the voiced/unvoiced identification 
15 circuit 2020. The noise classification circuit 2030 obtains a 
value , which reflects the average behavior of d a1 (n) , in an 
unvo i ced segmen t (no i s e s egmen t) by us i ng a I i near or non- I i near 
filter. The noise classification circuit 2030 calculates d 
a2 (n) in accordance with Equation (23) below when S vs =0 holds: 
20 d q2 (n) = P • d q2 (n - 1) + (1 - P 2 ) • d ql (n) ■ ■ -(23) 

where jS 2 =0.94 holds. The noise classification circuit 2030 
classifies noise by applying threshold-value processing to 
d q2 (n) an d decides a classification flag S nx . 
if (d~~ a2 (n) ^C th2 and S mode ^2) then S n x = 1 
25 else S n x = 0 
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That is, d — q 2 (n) ^C t „ 2 then S raode ^2 hold, the 
classification fiagS nx is 1 , otherwise, the c I ass i f i cat i on f I ag 
S„x is 0. 

Here C t h 2 represents a certain constant (1.7), S nx =1 
5 corresponds to noise in which the temporal change of the 

frequency characteristic is non-steady and S nx =0 corresponds to 
noise in which the temporal change of the frequency 
characteristic is steady. The noise classification circuit 
2030 outputs S nx to the first changeover circuit 2110. 

10 The gain g exc (j) (where j = 0, j=0, N ssfr - N sfr -1) output 

from the excitation-signal normalizing circuit 2510, the 
identification flag S v s output from the voiced/unvoiced 
identification circuit 2020 and the classification flag S n x 
output from the noise classification circuit 2030 are input to 

15 the first changeover circuit 2110. The latter changes over a 
switch in accordance with the value of the identification flag 
and the value of the classification flag, thereby outputting the 
gain G exc (j) to the first f i Iter 2150 when S v s =0 and S n x =0 hold, 
to the second filter 2160 when S vs =0 and S nx =1 hold and to the 

20 third filter 2170 when S vs =1 holds. 

The gain g exc (j) (where j = 0, ■•■,N ssfr -N sfr -1) output from 
the first changeover circuit 2110 is input to the first filter 
2150, which proceeds to smooth the gain using a linear or 
non-linear filter, adopts this as a first smoothed gain 

25 g e x c , 1 (J) an d outputs to the excitation-signal reconstruction 
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circuit 2610. Here use is made of a filter represented by 
Equation (24) below. 

gexc (n) = r 21 • g exCil (n - 1) + (1 - r 21 ) ■ g exc (n) ■ ■ -(24) 

where - g exCi1 (-1) corresponds to - g exCil (N ssfr N sfr -1) in the 
5 preceding frame. Further, it is assumed that r 21 =0. 9 holds. 

The gain g exc (j) (where j=0, N ssfr - N sfr — 1) output from 
the first changeover circuit 2110 is input to the second filter 
2160, which proceeds to smooth the gain using a linear or 
non-linear filter, adopts this as a second smoothed gain 
10 Sexc. 2 (J) and outputs to the excitation-signal reconstruction 
circuit 2610. Here use is made of a filter represented by 
Equation (25) below. 

gexc,2 (n) = r 22 • g exc , 2 (n - 1) + (1 - r 22 ) • g exc (n) ■ ■ -(25) 

where g e x c , z corresponds to g exc , 2 (N ssfr -N sfr -1) in the 
15 preceding frame. Further, it is assumed that r 2 2 =0. 9 holds. 

The gain G exc (j) (where j =0, ■•■ , N s , f r ■ N s , r -1 ) output from 

the first changeover circuit 2110 is input to the third filter 

2170, which proceeds to smooth the gain using a linear or 

non-linear filter, adopts this as a third smoothed gain ~~ 
20 g e x c . 3 (i) an d outputs to the excitation-signal reconstruction 

circuit 2610. Here it is assumed that g e x c , 3 (n) =g e x c (n) 

holds. 

Fig. 4 is a block diagram illustrating the construction of 
a speech signal decoding apparatus according to a fourth 
25 embodiment of the present invention. In the fourth embodiment, 
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as shown in Fig. 4, an input terminal 50 and a second changeover 
circuit 7110 are added to the arrangement of the first embodiment 
shown in Fig. 1 and the connections are changed accordingly. 
The added input terminal 50 and the second changeover circuit 
7110 will be described below. 

A changeover control signal enters from the input terminal 
50. The changeover control signal is input to the changeover 
circuit 7110 via the input terminal 50, and the second gain 
output from the second gain decoding circuit 1120 is input to 
the changeover circuit 7110. In accordance with the changeover 
control signal, the changeover circuit 7110 outputs the second 
gain to the second gain circuit 1130 or to the smoothing circuit 
1 320. 

Fig. 5 is a block diagram illustrating the construction of 
a speech signal decoding apparatus according to a fifth 
embodiment of the present invention. In the fifth embodiment, 
as shown in Fig. 5, the input terminal 50 and the second 
changeover circuit 7110 are added to the arrangement of the 
second embodiment shown in Fig. 2 and the connections are changed 
accordingly. The input terminal 50 and the second changeover 
circuit 7110 will be described below. 

A changeover control signal enters from the input terminal 
50. The changeover control signal is input to the changeover 
circuit 7110 via the input terminal 50, and the excitation vector 
output from the adder 1050 is input to the changeover circuit 
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7110. In accordance with the changeover control signal, the 
changeover circuit 7110 outputs the excitation vector to the 
synthesis filter 1040 or to the excitation-signal normalizing 
c i rcu i t 2510. 

Fig. 6 is a block diagram illustrating the construction of 
a speech signal decoding apparatus according to a sixth 
embodiment of the present invention. In the sixth embodiment, 
as shown in Fig. 6, the input terminal 50 and the second 
changeover circuit 7110 are added to the arrangement of the third 
embodiment shown in Fig. 3 and the connections are changed 
accordingly. The input terminal 50 and the second changeover 
circuit 7110 are identical with those described in the fifth 
embodiment of Fig. 5 and need not be described again. 

The speech signal encoder in the conventional speech signal 
encoding/decoding apparatus shown in Fig. 8 may used as the 
speech signal encoder in the speech signal encoding/decoding 
apparatus as a seventh embodiment of the present invention. 

The speech signal decoding apparatus in each of the 
foregoing embodiments of the present invention may be 
implemented by computer control using a digital signal processor 
or the like. Fig. 7 is a diagram schematically illustrating the 
construction of an apparatus for a case where the speech signal 
decoding processing of each of the foregoing embodiments is 
implemented by a computer in an eighth embodiment of the present 
invention. A computer 1 for executing a program that has been 
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read out of a recording medium 6 executes speech signal decoding 
processing for decoding information concerning at least a sound 
source signal, gain and linear prediction coefficients from a 
received signal, generatingan excitation signal and the linear 
prediction coefficients from the decoded information, and 
driving a filter, which is constituted by the linear prediction 
coefficients, by the excitation signal to thereby decode a 
speech signal. To this end, a program has been recorded on the 
recordingmedium 6. The program is for executing (a) process ing 
for performing smoothing using a past value of gain and 
calculating an amount of fluctuation between the original gain 
and the smoothed gain, and (b) processing for limiting the value 
of the smoothed gain in conformity with the value of the amount 
of fluctuation and decoding the speech signal us ing the smoothed, 
limited gain. This program is read out of the recording medium 
6 and stored in a memory 3 via a recording-medium read-out unit 
5 and an interface 4, and the program is executed. The program 
may be stored in amask ROM or the likeor inanon-volatilememory 
such as a flash memory. Besides a non-volatile memory, the 
recording medium may be a medium such as a CD-ROM, floppy disk, 
DVD (Digital Versatile Disk) or magnetic tape. In a case where 
the program is transmitted by a computer from a server to a 
communication medium, the recording medium would include the 
communication medium to which the program is communicated by 
wire or wireless! y. 
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The computer 1 for executing a program that has been read 
out of a recording medium 6 executes speech signal decoding 
processing for decoding information concerning an excitation 
signal and linear prediction coefficients from a received signal, 
5 generating the excitation signal and the linear prediction 
coefficients from the decoded information, and driving a fi Iter, 
which is constituted by the linear prediction coefficients, by 
the excitation signal to thereby decode a speech signal. To 
this end, a program has been recorded on the recording medium 

10 6. The program is for executing (a) processing for calculating 
a norm of the excitation signal at regular intervals and 
smoothing the norm using a past value of the norm; and (b) 
processing for limiting the value of the smoothed norm using an 
amount of fluctuation calculated from the norm and the smoothed 

15 norm, changing the amplitude of the excitation signal in the 
intervals using the norm and the norm that has been smoothed and 
limited, and driving the filter by the excitation signal the 
amplitude of which has been changed. 

The computer 1 for executing a program that has been read 

20 out of a recording medium 6 executes speech signal decoding 
processing for decoding information concerning an excitation 
signal and linear prediction coefficients from a received signal, 
generating the excitation signal and the linear prediction 
coefficients from the decoded information, and driving a fi Iter, 

25 which is constituted by the linear prediction coefficients, by 
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the excitation signal to thereby decode a speech signal. To 
this end, a program has been recorded on the recording medium 
6. The program is for executing (a) processing for identifying 
a voiced segment and a noise segment with regard to the received 
signal using the decoded information; (b) processing for 
calculating a norm of the excitation signal at regular intervals 
in the noise segment, smoothing the norm using a past value of 
the norm and limiting the value of the smoothed norm using an 
amount of fluctuation calculated from the norm and the smoothed 
norm; (c) processing for changing the amplitude of the 
excitation signal in the intervals using the norm and the norm 
that has been smoothed and limited, and driving the filter by 
the excitation signal the amplitude of which has been changed. 

Thus, in accordance with the present invent ion as described 
above, it is possible to suppress the occurrence of abnormal 
sound in noise segments, such sound being caused when, in the 
smoothing of sound source gain (second gain), the sound source 
gain smoothed in a noise segment takes on a value much larger 
than that of the sound source gain before smoothing. 

The reason for this effect is that the values which the 
smoothed sound source gain is capable of taking on are limited 
on the basis of amount of fluctuation, which is calculated using 
the difference between smoothed sound source gain and the sound 
source gain before smoothing, in such a manner that sound source 
gain that has been smoothed in a noise interval will not take 
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on a very large value in comparison with the sound source gain 
before smoothing. The entire disclosure of References 1,2,3 
and 4 is herein incorporated by reference thereto as the 
components and/or processings making up parts of the present 
invention, as far as these relate to the implementation of the 
present invention. The same applies to the disclosure of 
Reference 5. 

As many apparently widely different embodiments of the 
present invention can be made without departing from the spirit 
and scope thereof, it is to be understood that the invention is 
not limited to the specific embodiments thereof except as 
defined in the appended claims. 

It should be noted that other objects, features and aspects 
of the present invention will become apparent in the entire 
disclosure and that modifications may be done without departing 
the gist and scope of the present invention as disclosed herein 
and claimed as appended herewith. 

Also it should be noted that any combination of the 
disclosed and/or claimed elements, matters and /or items may fal I 
under the modifications aforementioned. 
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WHAT IS CLAIMED IS: 

1. A speech signal decoding method for decoding information 
concerning at least a sound source signal, gain and linear 
prediction coefficients from a received signal, generating an 
excitation signal and linear prediction coefficients from 
decoded information, and driving a filter, which isconstituted 
by the linear prediction coefficients, by the excitation signal 
to thereby decode a speech signal, comprising: 

a first step of smoothing the gain using a past value of 
the ga i n ; 

a second step of limiting the value of the smoothed gain 
based upon an amount of fluctuation calculated from the gain and 
the smoothed gain; and 

a third step of decoding the speech signal using the gain 
that has been smoothed and limited. 

2. A speech signal decoding method for decoding information 
concerning an excitation signal and linear prediction 
coefficients from a received signal, generating an excitation 
signal and linear prediction coefficients from the decoded 
information, and driving a filter, which is constituted by the 
linear prediction coefficients, by the excitation signal to 
thereby decode a speech signal, comprising: 

a first step of deriving a norm of the excitation signal 
at regu I ar i nterva I s ; 

a second step of smoothing the norm using a past value of 
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the norm; 

a third step of limiting the value of the smoothed norm 
based upon an amount of fluctuation calculated from the norm and 
the smoothed norm; 

a fourth step of changing the amplitude of the excitation 
signal in said intervals using said norm and the norm that has 
been smoothed and limited; and 

a fifth step of driving the filter by the excitation signal 
the amplitude of which has been changed. 

3. A speech signal decoding method for decoding information 
concerning an excitation signal and linear prediction 
coefficients from a received signal, generating the excitation 
signal and the linear prediction coefficients from the decoded 
information, and driving a filter, which is constituted by the 
linear prediction coefficients, by the excitation signal to 
thereby decode a speech signal, comprising: 

a first step of identifying a voiced segment and a noise 
segment with regard to the received signal using the decoded 
i nf ormat i on ; 

a second step of deriving a norm of the excitation signal 
at regular intervals in the noise segment; 

a third step of smoothing the norm using a past value of 
the norm; 

a fourth step of limiting the value of the smoothed norm 
based upon an amount of fluctuation derived from the norm and 
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the smoothed norm; 

a fifth step of changing the amplitude of the excitation 
signal in said intervals using the norm and the norm that has 
been smoothed and limited; and 

a sixth step of driving the filter by the excitation signal 
the amplitude of which has been changed. 

4. The method according to claim 1, wherein the amount of 
fluctuation is represented by dividing an absolute value of a 
difference between the gain and the smoothed gain by the gain, 
and the value of the smoothed gain is limited in such a manner 
that the amount of fluctuation will not exceed a predetermined 
threshold value. 

5. The method according to claim 2, wherein the amount of 
fluctuation is represented by dividing an absolute value of a 
difference between the norm and the smoothed norm by the norm, 
and the value of the smoothed norm is limited in such a manner 
that the amount of fluctuation will not exceed a predetermined 
thresho I d value. 

6. The method according to claim 3, wherein the amount of 
fluctuation is represented by dividing an absolute value of a 
difference between the norm and the smoothed norm by the norm, 
and the value of the smoothed norm is limited in such a manner 
that the amount of fluctuation will not exceed a predetermined 
thresho Id va I ue. 

7. The method according to claim 2, wherein the excitation 
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signal in said intervals is divided by the norm in said intervals 
and the quotient is multiplied by the smoothed norm in said 
intervals to thereby change the amplitude of the excitation 
s i gna I . 

8. The method according to claim 3, wherein the excitation 
signal in said intervals is divided by the norm in said intervals 
and the quotient is multiplied by the smoothed norm in said 
intervals to thereby change the amplitude of the excitation 

s i gna I . 

9. The method according to claim 1, wherein switching between 
use of the gain and use of the smoothed gain is performed in 
accordance with an entered switching control signal when the 
speech signal is decoded. 

10. The me thod acco r d i ng c I a i m 2, where i n swi tch i ng between use 
of the excitation signal and use of the excitation signal the 
amplitude of which has been changed is performed in accordance 
with an entered switching control signal when the speech signal 
is decoded. 

11. The method accord i ng c I a i m 3, where i n swi tch i ng between use 
of the excitation signal and use of the excitation signal the 
amplitude of which has been changed is performed in accordance 
with an entered switching control signal when the speech signal 
is decoded 

12. A speech signal encoding and decodingmethod com prising the 
steps of : 
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encoding an input speech signal by expressing the input 
speech signal by an excitation signal and linear prediction 
coef f i c i ents ; and 

performing decoding by the speech signal decoding method 
set forth in claim 1. 

13. A speech signal encoding and decoding method comprising the 
steps of : 

encoding an input speech signal by expressing the input 
speech signal by an excitation signal and linear prediction 
coefficients; and 

performing decoding by the speech signal decoding method 
set forth in claim 2. 

14. A speech signal encoding and decodingmethod comprising the 
steps of : 

encoding an input speech signal by expressing the input 
speech signal by an excitation signal and linear prediction 
coefficients; and 

performing decoding by the speech signal decoding method 
set forth in claim 3 

15. A speech signal decoding apparatus for decoding 
information concerning at least a sound source signal, gain and 
linear prediction coefficients from a received signal, 
generating an excitation signal and linear prediction 
coefficients from the decoded information, and drivinga filter, 
which is constituted by the linear prediction coefficients, by 
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the excitation signal to thereby decode a speech signal, 
comprising: 

a smoothing circuit smoothing the gain using a past value 
of the ga i n ; and 

a smooth i ng-quant i ty limiting circuit limiting the value 
of the smoothed gain based upon an amount of fluctuation 
calculated from the gain and the smoothed gain. 
16. A speech signal decoding apparatus for decoding 
information concerning an excitation signal and linear 
prediction coefficients from a received signal, generating the 
excitation signal and linear prediction coefficients from the 
decoded information, and driving a filter, which is constituted 
by the linear prediction coefficients, by the excitation signal 
to thereby decode a speech signal, comprising: 

an excitation-signal normalizing circuit deriving a norm 
of the excitation signal at regular intervals and dividing the 
excitation signal by the norm; 

a smoothing circuit smoothing the norm using a past value 
of the norm; 

a smooth i ng-quant i ty limiting circuit limiting the value 
of the smoothed norm based upon an amount of fluctuation 
calculated from the norm and the smoothed norm; and 

an excitation-signal reconstruction circuit multiplying 
the smoothed and I i m i ted norm by the excitation signal to thereby 
change the amplitude of the excitation signal in said intervals. 
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17. A speech signal decoding apparatus for decoding 
information concerning an excitation signal and linear 
prediction coefficients from a received signal, generating the 
excitation signal and linear prediction coefficients from the 
decoded information, and driving a filter, which is constituted 
by the linear prediction coefficients, by the excitation signal 
to thereby decode a speech signal, comprising: 

a voiced/unvoiced identification circuit identifying a 
voiced segment and a noise segment with regard to the received 
signal using the decoded information; 

an excitation-signal normalizing circuit deriving a norm 
of the excitation signal at regular intervals and dividing the 
excitation signal by the norm; 

a smoothing circuit smoothing the norm using a past value 
of the norm; 

a smooth i ng-quant i ty limiting circuit limiting the value 
of the smoothed norm based upon an amount of fluctuation 
calculated from the norm and the smoothed norm; and 

an excitation-signal reconstruction circuit multiplying 
the smoothed and limited norm by the excitation signal to thereby 
change the amplitude of the excitation signal in said intervals. 
18. The apparatus according to claim 15, wherein the amount of 
fluctuation is represented by dividing an absolute value of a 
difference between the gain and the smoothed gain by the gain, 
and the value of the smoothed gain is limited in such a manner 
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that the amount of fluctuation will not exceed a predetermined 
thresho I d value. 

19. The apparatus according to claim 16, wherein the amount of 
fluctuation is represented by dividing the absolute value of the 
difference between the norm and the smoothed norm by the norm, 
and the value of the smoothed norm is limited in such a manner 
that the amount of fluctuation will not exceed a predetermined 
thresho I d value. 

20. The apparatus according to claim 17, wherein the amount of 
fluctuation is represented by dividing the absolute value of the 
difference between the norm and the smoothed norm by the norm, 
and the value of the smoothed norm is limited in such a manner 
that the amount of fluctuation will not exceed a predetermined 
threshold value. 

21. The apparatus according to claim 15, wherein the apparatus 
comprises a switching circuit in which switching between use of 
the gain and use of the smoothed gain is performed in accordance 
with an entered switching control signal when the speech signal 
is decoded. 

22. The apparatus according to claim 16, wherein the apparatus 
comprises a switching circuit in which switching between use of 
the excitation signal and use of the excitation signal the 
amplitude of which has been changed is performed in accordance 
with an entered switching control signal when the speech signal 
is decoded. 
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23. The apparatus according to claim 17, wherein the apparatus 
comprises a switching circuit in which switching between use of 
the excitation signal and use of the excitation signal the 
amplitude of which has been changed is performed in accordance 

5 with an entered switching control signal when the speech signal 
is decoded. 

24. A speech signal encoding and decoding apparatus 
comp r i s i ng : 

a speech signal encoder encoding an input speech signal by 
expressing the input speech signal by an excitation signal and 
5 linear prediction coefficients; and 

the speech signal decoding apparatus set forth in claim 15. 

25. A speech signal encoding and decoding apparatus 
comp r i s i ng : 

a speech signal encoder encoding an input speech signal by 
expressing the input speech signal by an excitation signal and 
5 linear prediction coefficients; and 

the speech signal decoding apparatus set forth in claim 16. 

26. A speech signal encoding and decoding apparatus 
comprising: 

a speech signal encoder encoding an input speech signal by 
expressing the input speech signal by an excitation signal and 
5 linear prediction coefficients; and 
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the speech signal decoding apparatus set forth in claim 17. 

27. A program product for causing a computer to execute 
processing (a) and (b) below, wherein the computer constitutes 
a speech signal decoding apparatus for decoding information 
concerning at least a sound source signal, gain and linear 
prediction coefficients from a received signal, generating an 
excitation signal and linear prediction coefficients from the 
decoded information, and driving a filter, which is constituted 
by the linear prediction coefficients, by the excitation signal 
to thereby decode a speech signal: 

(a) processing of performing smoothing using a past value 
of a gain and calculating an amount of fluctuation between the 
gain and a smoothed gain; and 

(b) processing of limiting the value of the smoothed gain 
in conformity with the value of the amount of fluctuation and 
decoding the speech signal using the smoothed, limited gain. 

28. A program product for causing a computer to execute 
processing (a) to (c) below, wherein the computer constitutes 
a speech signal decoding apparatus for decoding information 
concerning an excitation signal and linear prediction 
coefficients from a received signal, generating an excitation 
signal and linear prediction coefficients from the decoded 
information, and driving a filter, which is constituted by the 
linear prediction coefficients, by the excitation signal to 
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thereby decode a speech signal: 

(a) processing of calculating a norm of an excitation 
signal at regular intervals and smoothing the norm using a past 
va I ue of the norm ; 

(b) processing of limiting the value of the smoothed norm 
in conformity with the value of an amount of fluctuation 
calculated from the norm and the smoothed norm; and 

(c) processing of changing the amplitude of the excitation 
signal in said intervals using the norm and the norm that has 
been smoothed and limited, and driving the filter by the 
excitation signal the amplitude of which has been changed. 
29. A program product for causing a computer to execute 
processing (a) to (d) below, wherein the computer constitutes 
a speech signal decoding apparatus for decoding information 
concerning an excitation signal and linear prediction 
coefficients from a received signal, generating an excitation 
signal and linear prediction coefficients from the decoded 
information, and driving a filter, which is constituted by the 
linear prediction coefficients, by the excitation signal to 
thereby decode a speech signal: 

(a) processing of identifying a voiced segment and a noise 
segment with regard to a received signal using decoded 
information; 

(b) processing of calculating a norm of an excitation 
signal at regular intervals in the noise segment and smoothing 
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the norm using a past value of the norm; 

(c) processing of limiting the value of the smoothed norm 
in conformity with an amount of fluctuation calculated from the 
norm and the smoothed norm; and 

(d) processing of changing the amplitude of the excitation 
signal in said intervals using the norm and the norm that has 
been smoothed and limited, and driving the filter by the 
excitation signal the amplitude of which has been changed. 

30. The program product according to claim 27, wherein said 
program product comprises a program for processing of 
representing the amount of fluctuation by dividing an absolute 
value of a difference between the gain and the smoothed gain by 
the gain, and limiting the value of the smoothed gain in such 
a manner that the amount of fluctuation will not exceed a 
predetermined threshold value. 

31. The program product according to claim 28, wherein said 
program product comprises a program for processing of 
representing the amount of fluctuation by dividing an absolute 
value of a difference between the norm and the smoothed norm by 
the norm, and limiting the value of the smoothed norm in such 
a manner that the amount of fluctuation will not exceed a 
predetermined threshold value. 

32. The program product according to claim 29, wherein said 
program product comprises a program for processing of 
representing the amount of fluctuation by dividing an absolute 
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value of a difference between the norm and the smoothed norm by 
5 the norm, and limiting the value of the smoothed norm in such 
a manner that the amount of fluctuation will not exceed a 
predetermined threshold value. 

33. The program product according to claim 28, wherein said 
program product comprises a program for processing of dividing 
the excitation signal in said intervals by the norm in said 
intervals and multiplying the quotient by the smoothed norm in 

5 said intervals to thereby change the amp I itude of the excitation 
s i gna I . 

34. The program product according to claim 29, wherein said 
program product comprises a program for processing of dividing 
the excitation signal in said intervals by the norm in said 
intervals and multiplying the quotient by the smoothed norm in 

5 said intervals to thereby change the ampl itude of the excitation 
s i gna I . 

35. The program product according to claim 27, wherein said 
program product comprises a program for processing of switching 
between use of the gain and use the smoothed gain in accordance 
with an entered switching control signal when the speech signal 

5 is decoded. 

36. The program product according to claim 28, wherein said 
program product comprises a program for processing of switching 
between use of the excitation signal and use of the excitation 
signal theamplitudeofwhichhasbeenchanged inaccordancewith 
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an entered switching control signal when the speech signal is 
decoded. 

37. The program product according to claim 29, wherein said 
program product comprises a program for processing of switching 
between use of the excitation signal and use of the excitation 
signal the amplitude of which has been changed in accordance with 
an entered switching control signal when the speech signal is 
decoded. 

38. A program product comprising a program for causing said 
computer to execute processing of performing decoding by the 
speech signal decoding method set forth in claim 1, when an input 
speech signal has been encoded by expressing the input speech 
signal by an excitation signal and linear prediction 

coef f i c i ents. 

39. A program product comprising a program for causing said 
computer to execute processing of performing decoding by the 
speech signal decoding method set forth in claim 2, when an input 
speech signal has been encoded by expressing the input speech 
signal by an excitation signal and linear prediction 

coef f i c i ents. 

40. A program product comprising a program for causing said 
computer to execute processing of performing decoding by the 
speech signal decodingmethod set forth in claim 3, when an input 
speech signal has been encoded by expressing the input speech 
signal by an excitation signal and linear prediction 
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coefficients. 

41. A speech signal decoding apparatus comprising: 

(a) a code input circuit splitting code of a bit sequence 
of an encoded input signal that enters from an input terminal, 
converting the code to indices that correspond to a plurality 
of decode parameters, outputting an index corresponding to a 
line spectrum pair, termed hereinafter "LSP", which represents 
the frequency characteristic of the input signal, to an LSP 
decoding circuit, outputting an index corresponding to a delay 
that represents a pitch period of the input signal to a pitch 
signal decoding circuit, outputting an index corresponding to 
a sound source vector comprising a random number or a pulse train 
to a sound source signal decoding circuit, outputting an index 
corresponding to a first gain to a first gain decoding circuit, 
and outputting an index corresponding to a second gain to a 
second gain decoding circuit; 

(b) an LSP decoding circuit, to which the index output from 
said code input circuit is input, and which reads the LSP 
corresponding to the input index out of a table which stores LSPs 
corresponding to indices, obtains an LSP in a subframe of the 
present frame and outputs the LSP; 

(c) a linear prediction coefficient conversion circuit, to 
which the LSP output from said LSP decoding circuit is input, 
andwhich converts the LSP to linear prediction co efficients and 
outputs the coefficients to a synthesis filter; 
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(d) a sound source signal decoding circuit, to which the 
index output from said code input circuit is input, and which 
reads a sound source vector corresponding to the index out of 
a table storing sound source vectors corresponding to indices, 
and outputs the sound source vector to a second gain decoding 
circuit; 

(e) a second gain decoding circuit, to which the index 
output from said code input circuit is input, and which reads 
a second gain corresponding to the input index out of a table 
storing second gains corresponding to indices, and outputs the 
second gain to a smoothing circuit; 

(f) a second gain circuit, to which a first sound source 
vector output from said sound source signal decoding circuit and 
the second gain are input, and which multiplies the first sound 
source vector by the second gain to generate a second sound 
source vector and outputs the generated second sound source 
vector to an adder; 

(g) a memory circuit holding an excitation vector input 
thereto from said adder and outputting a held excitation vector, 
which was input thereto in the past, to a pitch signal decoding 
circuit; 

(h) a pitch signal decoding circuit, to which the past 
excitation vector held by said memory circuit and the index 
output from said code input circuit are input, with said index 
specifying a delay, and which cuts out vectors of samples 
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corresponding to a vector length from a point previous to the 
starting point of the present frame by an amount corresponding 
to the delay to thereby generate a first pitch vector, and 
outputs the first pitch vector to a first gain circuit; 

(i) a first gain decoding circuit, towhichtheindexoutput 
from said code input circuit is input, and which reads a first 
gain corresponding to the input index out of a table storing 
first gains corresponding to indices, and outputs the first gain 
to a first gain ci rcu i t ; 

(j) a first gain circuit, to which the first pitch vector 
output f rom said pitch signal decod i ng circuit and the first ga in 
output from said first gain decoding circuit are input, andwhich 
multiplies the input first pitch vector by the first gain to 
generate a second pitch vector, and outputs the generated second 
pitch vector to said adder ; 

(k) an adder, to which the second pitch vector output from 
said first gain circuit and the second sound source vector output 
from said second gain circuit are input, and which calculates 
thesumof these inputs, andoutputs the sum to a syn thesis filter 
as an exc i tat i on vector ; 

(I) a smoothing coefficient calculation circuit, to which 
LSP output from said LSP decoding circuit is input, and which 
calculates average LSP in the present frame, finds the amount 
of fluctuation of the LSP with respect to each subframe, finds 
a smoothing coefficient in the subframe, and outputs the 
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75 smoothing coefficient to a smoothing circuit; 

(m) a smooth i ng c i rcu i t, to wh i ch the smooth i ng coeff i c i ent 
output from said smoothing coefficient calculation circuit and 
the second gain output from said second gain decoding circuit 
are input, and which finds an average gain from the second gain 

80 in the subframe, and outputs the second gain; 

(n) a synthesis filter, to which the excitation vector 
output from said adder and the linear prediction coefficients 
output from said linear prediction coefficient conversion 
c i rcu i t are i nput, and wh i ch dr i ves a synthes i s f i I ter, for that 

85 the linear prediction coefficients have been set, by the 

excitation vector to thereby calculate a reconstructed vector, 
and outputs the reconstructed vector from an output terminal; 
and 

(o) a smooth i ng-quant i ty limiting circuit, to which the 
90 second gain output from said second gain decoding circuit and 
the smoothed second gain output from said smoothing circuit are 
input, and which finds the amount of fluctuation between the 
smoothed second gain output from said smoothing circuit and the 
second gain output from said second gain decoding circuit, 
95 outputs the smoothed second gain to said second gain circuit as 
is when the amount of fluctuation is less than a predetermined 
threshold value, replaces the smoothed second gain with a 
smoothed second gain limited in terms of values it is capable 
of taking on when the amount of fluctuation is equal to or greater 
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than the threshold value, and outputs this smoothed second gain 
to said second gain circuit. 

42. The apparatus according to claim 41, further comprising: 
(p) an excitation-signal normali zing circuit, towhichan 
excitation vector in a subframe output from said adder is input, 
and which calculates gain and a shape vector from the excitation 
vector every subframe or every sub-subframe obtained by 
subdividing a subframe, outputs the gain to said smoothing 
circuit, and outputs the shape vector to an excitation-signal 
recons t ruct ion c i rcu i t ; and 

(q) an exc i tat i on-s i gna I r econ s t r u c t i on c i r cu i t, to which 
the gain output from said smooth i ng-quant i ty limiting circuit 
and the shape vector output from said excitation-signal 
normalizing circuit are input, and which calculates a smoothed 
excitation vector, and outputs this excitation vector to said 
memory circuit and to said synthesis filter; 

(r) wherein said smoothing circuit has the output of said 
excitation-si gnal normalizing circuit input thereto instead of 
the output of said second gain decoding circuit and has the 
output of said smoothing coefficient calculation circuit input 
thereto; 

(s) said smooth i ng-quant i ty limiting circuit has the 
smoothed gain output from said smoothing circuit applied to one 
input terminal thereof and has the gain output from said 
excitation-signal normalizing circuit, rather than the output 
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of said second gain decoding circuit, applied to the other input 
25 terminal thereof, finds the amount of fluctuation between the 
smoothed gain output from said smoothing circuit and the gain 
output from said excitation-signal normalizing circuit, 
supplies the smoothed gain as is to said excitation-signal 
reconstruction circuit when the amount of fluctuation is less 
30 than a predeterm i ned thresho I d va I ue, rep I aces the smoothed ga i n 
with a smoothed gain limited in terms of values it is capable 
of taking on when the amount of fluctuation is equal to or greater 
than the threshold value, and supplies this smoothed gain to the 

i excitation-signal reconstruction circuit; and 

-35 (t) theoutputofsaidsecondgaindecodingcircuit is input 

= to said second gain circuit as second gain. 

43. The apparatus according to claim 42, further comprising: 

- a power calculation circuit, to which the reconstructed 

vector output from said synthesis filter is input, and which 
calculates the sum of the squares of the reconstructed vector 
5 and outputting the power to a voiced/unvoiced identification 
circuit; 

a speech mode decision circuit, to which a past excitation 
vector held by said memory circuit and an index specifying a 
delay output from said code input circuit are input, and which 
10 calculates a pitch prediction gain in a subframe from the past 
excitation vector and the delay, determines a predetermined 
threshold valuewith respect to the pitch prediction gain orwith 
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respect to an in-frame average value of the pitch predict ion gain 
in a certain frame, and sets a speech mode; 

a voiced/unvoiced identification circuit, to which an LSP 
output from said LSP decoding circuit, the speech mode output 
from said speech mode decision circuit and the power output from 
said power calculation circuit are input, and which finds the 
amount of fluctuation of a spectrum parameter, identifying a 
voice segment and an unvoiced segment based upon the amount of 
fluctuation, and outputs amount-of-f I uctuat i on information and 
an i dent i f i cat ion flag; 

a noise classification circuit, to which the amount-of- 
fluctuation information and identification flag output from 
said voiced/unvoiced identification are input, and which 
classifies noise and outputting a classification flag; and 

a first changeover circuit, to which the gain output from 
sa i d exc i tat i on-s i gna I normalizing circuit, the identification 
flag output from said voiced/unvoiced identification circuit 
and the classification flagoutput from the noise classification 
circuit are input, and which changes over a switch in accordance 
with a value of the identification flag and a value of the 
classification flag to thereby switchingly output the gain to 
any one of a plurality of filters having different filter 
characteristics from one another; 

wherein the filter selected from among said plurality of 
filters has the gain output from said first changeover circuit 
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applied thereto, smoothes the gain using a linear filter or 
non-linear filter and outputs the smoothed gain to said 
40 smooth i ng-quant i ty limiting circuit as a first smoothed gain; 
and 

said smooth i ng-quan t i ty limiting circuit has the first 
smoothed gain output from the selected filter applied to one 
input terminal thereof, has the output of said excitation-signal 

45 normal izing circuit applied to the other input terminal thereof, 
finds the amount of fluctuation between the gain output from said 
excitation-signal normalizing circuit and the first smoothed 
gain output from said selected filter, uses the first smoothed 
gain as is when the amount of fluctuation is less than a 

50 predeterm i ned thresho I d va I ue, rep I aces the f i rs t smoothed ga i n 
with a smoothed gain limited in terms of values it is capable 
of takingonwhen the amount offluctuation is equal to or greater 
than the threshold value, and suppl ies this smoothed gain to said 
excitation-signal reconstruction circuit. 

44. The apparatus according to claim 41, further comprising a 
changeover circuit sw itching between a mode of using of the gain 
and a mode of using the smoothed gain as the input to said second 
gain circuit in accordance with a switching control signal, 

5 which has entered from an input terminal, when the speech signal 
is decoded. 

45. The apparatus according to claim 42, further comprising a 
changeover circuit to which the excitation vector output from 
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aid adder is input, and which outputs the excitation vector to 
said synthesis filter or to said excitation-signal normalizing 
5 circuit in accordance with a changeover control signal, that has 
entered from an input terminal. 

46. The apparatus according to claim 43, further comprising a 
changeover circuit to which the excitation vector output from 
aid adder is input, and which outputs the excitation vector to 
said synthesis filter or to said excitation-signal normalizing 
5 c i rcu i t i n accordance wi th a changeove r con t ro I s i gna I , that has 
entered from an input terminal. 
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ABSTRACT OF THE DISCLOSURE 
The quality of reconstructed speech on which background 
noise is superimposed is improved in a speech signal decoding 
apparatus for generating a speech signal by driving a filter, 
5 which is constituted by linear prediction coefficients, by an 
excitation signal. A smoothing circuit smoothes sound source 
gain in a noise segment using sound source gain that was obtained 
in the past. A smooth i ng-quan t i ty limiting circuit calculates 
an amount of fluctuation represented by dividing, by the sound 
10 source gain, the absolute value of the difference between the 
sound source gain and the sound source gain that has been 
smoothed, and limits the value of the smoothed gain in such a 
manner that the amount of fluctuation will not exceed a certain 
threshold value. 
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